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Feline coronavirus (FCoV) infections are endemic amongst cats worldwide. The 
majority of infections are asymptomatic, or result only in mild enteric disease. However, 
approximately 5% of cases develop feline infectious peritonitis (FIP), a systemic 
disease that is a frequent cause of death in young cats. In this study, we report the 
complete coding genome sequences of six FCoVs; three from fecal samples from 
healthy cats and three from tissue lesion samples from cats with confirmed FIP. The 
six samples were obtained over a period of eight weeks at a single-site cat rescue and 
rehoming center in the UK. We found amino acid differences are located at 44 
positions across an alignment of the six virus translatomes and, at 21 of these 
positions, the differences fully or partially discriminate between the genomes derived 
from the fecal samples and the genomes derived from tissue lesion samples. In this 
study, two amino acid differences fully discriminate the two classes of genomes; these 
are both located in the S2 domain of the virus surface glycoprotein gene. We also 
identified deletions in the 3c protein ORF of genomes from two of the FIP samples. Our 
results support previous studies that implicate S protein mutations in the pathogenesis 
of FIP. 
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Summary 


25 Feline coronavirus (FCoV) infections are endemic amongst cats worldwide. The 

majority of infections are asymptomatic, or result only in mild enteric disease. Flowever, 
approximately 5% of cases develop feline infectious peritonitis (FIP), a systemic disease that 
is a frequent cause of death in young cats. In this study, we report the complete coding 
genome sequences of six FCoVs; three from fecal samples from healthy cats and three from 
30 tissue lesion samples from cats with confirmed FIP. The six samples were obtained over a 
period of eight weeks at a single-site cat rescue and rehoming center in the UK. We found 
amino acid differences are located at 44 positions across an alignment of the six virus 
translatomes and, at 21 of these positions, the differences fully or partially discriminate 
between the genomes derived from the fecal samples and the genomes derived from tissue 
35 lesion samples. In this study, two amino acid differences fully discriminate the two classes of 
genomes; these are both located in the S2 domain of the virus surface glycoprotein gene. 
We also identified deletions in the 3c protein ORF of genomes from two of the FIP samples. 
Our results support previous studies that implicate S protein mutations in the pathogenesis of 
FIP. 

40 
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Introduction 


45 Coronaviruses are enveloped, positive-stranded RNA viruses. They are generally 

responsible for mild enteric and respiratory infections but they can also be associated with 
severe disease in both humans and animals (Masters & Perlman, 2013). Coronaviruses are 
now recognized as emerging viruses with a propensity to cross into new host species, as has 
been shown by the recent outbreaks of severe acute respiratory syndrome and Middle East 
50 respiratory syndrome (Coleman & Frieman, 2014). As illustrated in Fig. 1 for feline 
coronavirus (FCoV), two-thirds of the coronavirus genome encodes proteins involved in viral 
RNA synthesis. The majority of these proteins are encoded in two 5'-proximal, overlapping 
open reading frames (ORFs), ORFIa and ORFIb, and are translated as polyproteins, ppla 
and pplab, which are then processed by virus-encoded proteinases into 16 nonstructural 
55 proteins (Ziebuhr, 2005) . The remainder of the genome encodes the virus structural proteins 
(S, E, M and N) as well as accessory proteins that are not essential for replication in cell 
culture. The structural and accessory proteins are translated from a 3’ co-terminal nested set 
of subgenomic mRNAs (Perlman & Netland, 2009). 

The coronavirus surface or spike (S) glycoprotein is a typical class 1 viral fusion 
60 protein and it has a central role in the biology of coronavirus infection. Structurally, the 
protein can be divided in to an amino-proximal half, the SI domain, which contains the 
receptor-binding domain and a carboxyl-proximal half, the S2 domain, which contains 
elements involved in membrane fusion. These elements include heptad repeats, a fusion 
peptide and a carboxyl-terminal, hydrophobic transmembrane domain (Heald-Sargent & 

65 Gallagher, 2012). In many coronaviruses, the SI and S2 domains are cleaved from each 

other by a cellular, furin-like enzyme (de Plaan etal., 2004). The S protein is also the location 
of both B cell and T cell epitopes that are important in virus neutralization and the recognition 
of virus-infected cells (Reguera et al., 2012; Satoh et at., 2011). 
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70 


FCoVs form two antigenically distinct serotypes; serotype 1, which are difficult to 
propagate in cell culture, and serotype 2, which are the consequence of a double 
recombination between type 1 FCoV 1 and canine coronavirus (Flerrewegh et al., 1998) and 
are relatively easy to propagate in cell culture. FCoV infections are endemic amongst cats 
worldwide, and serological and molecular studies confirm that serotype 1 FCoVs 
75 predominate (Pedersen, 2014b). In the United Kingdom about 40% of domestic cats have 
been infected with FCoV and in multicat households this figure increases to almost 90% 
(Addie, 2000; Addie & Jarrett, 1992). The majority of FCoV infections are asymptomatic, or 
result only in mild enteric disease. However, approximately 5% of infected cats develop feline 
infectious peritonitis (FIP), a systemic inflammatory disease that is a frequent cause of death 
80 in young cats (Kipar & Meli, 2014). Currently, there is no protective vaccine or effective 
treatment for FIP (Pedersen, 2009; 2014a). 

The most important questions in FCoV research are why some infected animals 
remain relatively healthy, whilst others develop FIP, and what is the role of the virus in the 
development of disease. It is now widely accepted that, in the vast majority of cases, cats are 
85 infected by the fecal-oral route with avirulent FCoV strains circulating in the cat population. 
Initially, this virus replicates predominantly in the intestinal epithelium and is shed with the 
feces. Nonetheless, it often leads to systemic infection via monocyte-associated viremia 
(Kipar et al., 2010; Meli et al., 2004; Porter et al., 2014). At this stage, however, the systemic 
infection is characterized by a relatively low level of virus replication and infection can be 
90 maintained for a prolonged period of time, possibly involving recurrent viremic events, 
without apparent disease (Kipar et al., 2010). During replication in the intestine or, potentially, 
within monocytes/macrophages (Pedersen et al., 2012), the virus undergoes mutation and 
viruses with an enhanced tropism for monocytes and macrophages emerge. The altered 

1 In this paper, unless otherwise stated, FCoV will be used to mean serotype 1 FCoV. FCoV 
is also used as a strain designation for the species Alphacoronavirus 1 in the genus 
Alphacoronavirus, family Coronaviridae. 
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tropism of these mutants results in their ability to maintain effective and sustainable 
95 replication in monocytes (Dewerchin et al., 2005). As a direct or indirect result of a higher 
level of virus replication, this now apparently virulent virus leads to activation of monocytes 
(Regan et al., 2009), which can then interact with endothelial cells. This, in turn, mediates 
granulatomous phlebitis and periphlebitis, the morphological hallmark and initiating lesion of 
FIP (Kipar et al., 2005). 

100 In addition to the virus, the susceptibility of the individual infected cat to disease also 

plays a significant role and it has been shown that age, breed, gender, reproductive status 
and immune response influences the development of FIP (Pedersen, 2014b; Pedersen et al., 
2014). For example, the efficacy of early T cell responses critically determines the disease 
outcome in cats that have been experimentally infected with a virulent serotype 2 strain, 
105 FIPV 79-1146 (de Groot-Mijnes et al., 2005). Furthermore, there is individual variation in the 
susceptibility of a cat’s monocytes to FCoV (Dewerchin et al., 2005). Also, recently, single 
nucleotide polymorphisms in the feline interferon-y gene have been linked to both resistance 
and susceptibility to the development of FIP (Hsieh & Chueh, 2014). Clearly, unraveling the 
relationship between FCoV genotypes and phenotypes, and the complex interactions 
110 between the virus and host during the development of FIP remains a major challenge. 

One facet of this challenge is to determine the mutations that alter the tropism and 
virulence of FCoV. As a first step, this can be done by comparing the genomic sequences of 
viruses shed in the feces of healthy animals and viruses that predominate within tissue 
lesions of cats that have been diagnosed with FIP. This approach assumes that the most 
115 highly abundant genome in a population is responsible for a particular disease phenotype, 
which is, however, consistent with our current understanding of FIP epidemiology. Using this 
approach, a recent paper published by Chang et al. (Chang et al., 2012) provided evidence 
for an association between FCoV virulence and amino acid substitutions within the putative 
fusion peptide of the FCoV spike (S) protein. A more detailed examination of samples from 
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120 FCoV infected cats that did not have histopathological evidence of FIP, led Porter et al. 
(Porter et al., 2014) to conclude that these substitutions were indicative of systemic spread, 
rather than a virus that, without further mutation, is able to cause FIP. As the S protein fusion 
peptide is involved in the fusion of viral and cellular membranes during virus entry, it seems 
plausible that changes within this region may be linked to the tropism of the virus. 

125 Similarly, Licitra et al. (Licitra et al., 2013) were able to distinguish between FCoVs in 

cats with and without FIP on the basis of one or more substitutions in the amino acid 
sequence that comprises the furin cleavage site within the FCoV S protein. The authors 
demonstrated that these substitutions modulated furin cleavage and suggested that a 
possible consequence of the identified substitutions is an enhanced cleavability by 
130 alternative, monocyte/macrophage specific proteases. 

Finally, there have been many reports over the years of point mutations and indels in 
the accessory protein genes of FCoVs and claims that these may be linked to the 
development of FIP. Prominent amongst these are reports that truncating and non-truncating 
mutations in the ORF3c gene occur in a significant proportion but not all FCoVs associated 
135 with FIP (Chang et al., 2010; Pedersen et al., 2012). Flowever, the role of the FCoV 3c 
protein and any relationship to the development of FIP is still unclear. One view is that 
functional 3c protein expression is essential for replication in the gut but is dispensable for 
systemic replication. Thus, once the virus has left the gut there is no further selection 
pressure to maintain an intact 3c gene and mutations will accumulate over time. This 
140 interpretation does not exclude the possibility that the loss or alteration of the 3c protein may 
enhance the fitness of the virus in the monocyte/macrophage environment but this is not yet 
supported by any convincing evidence. Similarly, whilst the genes encoding the 3a, 3b, 7a 
and 7b proteins clearly have important functions that will impact on virus fitness (Plaijema et 
al., 2004), there is, as yet, no evidence that links specific mutations in these genes to the 
145 development of FIP. 
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In this study, we report the genome sequences of six FCoVs; three from fecal samples from 
healthy cats and three from tissue lesion samples from cats with confirmed FIP. The six 
samples were obtained from cats that were resident at a single-site cat rescue and rehoming 
center in the UK. Our results support and extend previous studies that implicate S protein 
150 mutations in the pathogenesis of FIP. 
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Results 


FCo V RNA in fecal and tissue lesion samples 

155 As a first step, we amplified the FCoV RNA in fecal and tissue lesion samples. The seven 
amplicons for each of the fecal-derived RNA samples were of the expected size and were 
produced in approximately equal amounts. In comparison, there was greater heterogeneity in 
the amplicons obtained from RNA isolated from the FIP tissue lesions (Fig. 2). Specifically, 
there was more evidence of non-specific products and, especially in the case of amplicon 6, 
160 which encompasses the region of the genome encoding the S protein gene, there was less 
product than expected. In this context, we noted that the Ct values were generally higher (i.e. 
less viral RNA) for fecal samples than for samples from the FIP tissue lesions. The mean Ct 
values for the 65F, 67F and 80F fecal total RNA samples were 20.9, 16.9 and 29.0, 
respectively and the mean Ct values for the 26M, 27C and 280 tissue lesion samples were 
165 14.0, 21.5 and 15.0, respectively. One explanation for the difference in homogeneity of 

amplicons derived from fecal and lesional samples may be that the samples derived from 
lesions contain significantly greater amounts of FCoV subgenomic mRNA than the fecal 
samples, which would be expected to contain mainly virion particles. Also, 
immunohistochemistry identified a large number of macrophages with abundant viral antigen 
170 (i.e. N protein) within the lesions (data not shown). It is therefore very likely that the RNA 

extracted from the lesions contains much more viral mRNA than the feces. Thus, in the RT- 
PCR reactions that involve RNA from tissues, many of the oligonucleotide primers would 
bind to multiple templates, resulting in a more complex amplicon pattern. 

175 Assembly of genome sequences 

Using the methods described, we were able to obtain full genome coverage, with a 
minimum depth of 1000 reads at each base across the coding region (Fig. 3). We expect that 
with further optimization, it would be possible to obtain an acceptable level of coverage and 
depth for more than 4 complete genomes per single 316v2 chip. Similarly, it would also be 
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180 possible to obtain a very high density of reads for a single genome if, for example, the goal 
was to investigate the nature of the viral quasispecies in a particular sample. In our opinion, 
the limiting step in genome sequencing from clinical samples is the production of amplicons 
but, once this has been achieved, the downstream processing is relatively straight forward. 

Our approach was based upon the alignment of sequence reads to a de novo 
185 assembled target genome and this is dependent upon a relatively high similarity between 
samples. For example, in the case of the 65F, 67F, 26M and 280 samples, the percentages 
of reads that aligned to the 80F target genome were 96%, 95%, 90% and 95% respectively. 
Flowever, only 76.8% of reads from the 27C sample aligned to the 80F target genome. Thus, 
for the 27C sample, the de novo assembly method had to be used. De novo assembly is 
190 more time consuming and would not be a good approach if every sample had to be analyzed 
in this manner, as would be the case if they were highly divergent. It should also be noted 
that in our analysis, we have only compared genome consensus sequences where each 
position is defined by a single nucleotide. In reality, for any sample, many nucleotide 
positions are represented by a proportion of different nucleotides. In these cases, we have 
195 taken the majority nucleotide as the consensus nucleotide and have not attempted to 

delineate different populations in the quasispecies. This means that when comparing 

sequences, we are only able to identify mutations throughout the population of genomes and 
do not conclude that any or all of these mutations are found in a single genomic RNA. 

200 Phylogenetic analysis 

Phylogenetic analysis of the six clinical samples described here, based upon the 
conserved RNA-dependent RNA polymerase (RdRp), shows that they comprise a closely 
related cluster (Fig. 4). As reported by Barker et al. (Barker et al., 2013), there is no evidence 
that the samples derived from FIP or non-FIP animals represent genetically diverse co- 
205 circulating strains, which provides further support for the “internal mutation” hypothesis. 

Flowever, it is very difficult to exclude the possibility that at least some of the mutations that 
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may contribute to the development of FIP are present in a minor component of the infecting 
population, which is subsequently selected during virus replication in vivo. 

210 Comparison of FCo V genome sequences from clinical samples 

The genome sequences of the six FCoVs derived from fecal and tissue lesion 
samples were translated into two polyproteins (ppla and pplab), four structural proteins (S, 
M, N and E) and 5 accessory proteins (3s, 3b, 3c, 7a and 7b). We found that amino acid 
differences are located at 44 positions across an alignment of the six translatomes. At 21 of 
215 these positions, the differences fully or partially discriminate between the genomes derived 
from fecal (i.e. non-FIP) samples and from tissue (i.e. FIP) samples. More specifically, in 
these 21 positions one, or more, of the translatomes from the FIP samples displays an amino 
acid that is not found at the corresponding position in the translatomes from any of the non- 
FIP samples (Table 1). We also identified deletions in the 3c protein ORF of genomes from 
220 two of the FIP samples. 

The fully discriminatory differences we identified are located at two positions where a 
different amino acid is found in all three FIP translatomes compared to all three non-FIP 
translatomes. The first of these is at nucleotide position 23302 and corresponds to the 
methionine to leucine substitution identified by Chang et al. (Chang et at., 2012). Thus, our 
225 data support the idea that this substitution may be critical with regard to the pathogenesis of 
FIP. The second fully discriminatory substitution we identified, which was present in all of the 
FIP samples but none of the non-FIP samples, was at nucleotide position 23486 and resulted 
in an isoleucine to threonine substitution in the heptad repeat region 1 (HR1) of the S2 
domain in the FCoV S protein. The possible significance of this substitution is discussed in 
230 more detail below. 

Apart from the fully discriminatory substitutions described above, Table 1 shows a 
further 19 positions where one or two of the translatomes from the FIP samples displays an 
amino acid that is not found at the corresponding position in the translatomes from non-FIP 
samples. Without any further information, it is difficult to conclude that any of these 
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substitutions, alone or in combination, may be related to the development of FIP. However, 
they should not be ignored. For example, the substitutions resulting from mutations at 
positions 22528 and 22539 both lie within the furin cleavage motif that separates the SI 
(receptor-binding) and S2 (fusion) domains of the FCoV S protein. Both substitutions (R789G 
at P4 and R792S at PI, where P4 and PI designate positions in the canonical furin cleavage 
240 motif) would be predicted to alter furin cleavage activity. If this is the case, our results support 
the conclusions of Licitra et at. (Licitra et all, 2013) that identify the furin cleavage site as a 
potentially important region in the development of FIP. Alternatively, it could be argued that 
once the virus has acquired a tropism for the monocyte/macrophage, cleavage at the furin 
recognition motif may no longer be relevant to virus entry and mutations may accumulate 
245 due to a lack of selection pressure. For coronaviruses such as mouse hepatitis virus (MHV), 
cleavage at the canonical furin motif does not seem to be essential, at least for in vitro 
infectivity (Bos et at., 1997), and recent results suggest that activation of the coronavirus S 
protein fusion activity requires proteolytic cleavage at a different position in the S2 subunit 
(Millet & Whittaker, 2014; Wicht et at., 2014). Finally, Table 1 shows that two of the three 
250 translatomes derived from the FIP samples have a deletion in the 3c protein gene, which is 
not found in any of the non-FIP samples. In both cases, the deletion of 10 nucleotides leads 
to a translational frameshift that produces a 3c protein truncated eight amino acids 
downstream of the deletion site. 

In addition to amino acid substitutions that partially or fully discriminate between the 
255 genomes derived from non-FIP and FIP samples, our study has also identified a further 23 
amino acid substitutions that do not discriminate between non-FIP and FIP genomes. These 
are listed in Table 2. These substitutions will not be discussed in detail but it is, perhaps, 
worth noting that the majority are found either in the nsp3 protein or the amino-proximal SI 
region of the S protein. This suggests that these regions may represent the targets of 
260 particularly strong selective pressures. In the case of the SI region of the S protein, we 
speculate that this selective pressure is immunological and relates to production of 
neutralizing antibodies. The selective pressures that target the nsp3 protein are unknown. 
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For completeness, we also note that we identified a single G to T mutation in the 3’ UTR at 
position 28926 of the consensus sequence derived from the 26M sample that was not found 
265 in any other sample. 
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Discussion 


This study demonstrates an approach to the complete genome sequencing of FCoVs derived 
270 from clinical material that is achievable in a standard laboratory setting. It is based upon the 
generation of a virus-specific cDNA library using oligonucleotide primer pairs, followed by 
next generation sequencing (NGS) on a commercial platform, and downstream genome 
assembly using free software that will run on a personal computer. This approach was taken 
after we had failed to determine complete genome sequences of FCoV from clinical samples 
275 using a randomly primed cDNA library followed by NGS (Porter, PhD thesis, University of 
Bristol, 2014). In the study reported here, complete genome sequencing was achieved for six 
FCoVs using only seven primer pairs. Plowever, the samples we used were all collected 
within a few months at a single location, which means that they are less likely to have 
diverged, compared to samples taken at different locations over a longer time period. As the 
280 number of complete genome sequences for both serotype 1 and serotype 2 FCoVs 
increases, it may be possible to design a set of universal primer pairs that will only require 
minor optimization to successfully sequence any FCoV genome. In our own laboratory, we 
have shown that the seven primer pairs described here are able to produce amplicons of the 
expected size in approximately two-thirds of geographically divergent UK fecal samples 
285 collected over a 2 year period (unpublished results). 

In addition to confirming earlier findings, the most interesting result of this study is 
undoubtedly the identification of a consistent substitution of isoleucine with threonine at 
amino acid position 1108 in all FCoVs from FIP lesions compared to the fecal samples from 
healthy cats. This substitution is located within the heptad HR1 region of the S2 subunit of 
290 the FCoV S protein and could be interesting from two points of view. First, we note that this 
amino acid position has been identified as being located in a major T helper 1 epitope (I-S2- 
6, IGNITLALGKVSNAITTTSD) in a type 1 Japanese FCoV (KU-2) that was associated with 
FIP (Satoh et al., 2011). Obviously, further research will be required to ascertain whether 
there is a Thl epitope spanning this amino acid sequence in non-FIP associated FCoVs, and 
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295 to determine the quantitative or qualitative effect that may result from the isoleucine to 
threonine substitution. However, de Groot-Mijnes et al. (de Groot-Mijnes et at., 2005) have 
already drawn attention to the relationship between T cell depletion and the enhanced virus 
replication in FIP cases, although the mechanisms of T cell depletion are not yet clear. We 
suggest this is an area of FIP research that merits further study. For example, it would be 
300 interesting to compare IFN-y production by peripheral blood mononuclear cells taken from 
cats with FIP or healthy, FCoV-infected cats, and exposed separately to relevant HR1 
peptides, the sequences of which are derived from FIP and non-FIP associated FCoVs. 

Second, a quite different interpretation of the HR1 isoleucine to threonine substitution 
is that it may be related to the fusogenic activity of the FCoV S protein. This is because the 
305 substitution also lies within a stretch of 15 amino acids (NAITTI/TSDGFNTMAS) that are 
found only in alphacoronaviruses and are part of the heptad repeat structure that 
characterizes the HR1 region. Indeed, the isoleucine/threonine position constitutes a residue 
predicted to be located on the hydrophobic interface of the coiled-coil structure. Substitution 
of a hydrophobic residue with a polar, uncharged residue may, at least theoretically, 
310 significantly influence the intercalation of HR1 and HR2 regions, which is a necessary event 
during membrane fusion. It is also worth noting that a very recent study by Bank-Wolf et al. 
has identified a position two residues downstream of the isoleucine to threonine substitution 
where an aspartate residue was found in all examined non-FIP associated FCoVs (5 from 5) 
but was replaced by a tyrosine in a significant proportion (5 from 9) of the FlP-associated 
315 FCoVs (Bank-Wolf et al., 2014). Neither the isoleucine to threonine nor aspartate to tyrosine 
substitutions consistently discriminate between FIP and non-FIP FCoVs in the wider 
alignment of 29 type 1 FCoV S protein amino acid sequences that we have examined (data 
not shown) but, again, we think they may represent substitutions that are functionally related 
and could be relevant to the development of FIP. 

320 The comparative sequence approach taken by ourselves and others has identified a 

number of potentially interesting mutations in the coding sequences of non-FIP and FIP 
associated FCoVs. In the future, this approach can be extended, i.e. a larger collection of 
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well-defined clinical samples should be analysed, and it can be refined. For example, to 
distinguish mutations that may relate to the tropism of FCoVs from those that may relate to 
325 virulence, we suggest it would be important to obtain sequence data from a virus population 
that infects monocytes but is not able to replicate at a high level. Clearly obtaining 
appropriate clinical samples (e.g., blood monocytes from clinically healthy, FCoV infected 
cats) would not be easy but it would be very illuminating. The idea that a virus has to 
undergo sequential mutation in vivo in order to cause a specific disease is not unique to FIP 
330 (see, for example, the review on measles virus pathogenesis by de Vries et al. (de Vries et 
al., 2012)) but, we suggest, it deserves closer attention in a number of veterinary and human 
diseases. 

Nevertheless, this sequencing approach is ultimately limited. As has been stated 
before, compelling evidence that any specific mutation in the FCoV genome is important for 
335 the development of FIP will require the use of well-defined and characterized viruses 
produced by reverse genetics and a valid experimental model of FIP. With respect to reverse 
genetics, there are a number of robust reverse genetic systems available for coronaviruses, 
in general, and for particular strains of FCoV (namely the type 2 FCoV strain 79-1146 and 
the cell culture adapted typel FCoV strain Black) (Thiel et al., 2014). The pressing need, 
340 however, is for a robust reverse genetic system that can be applied to field strains of type 1 
FCoV. In our opinion, the bottleneck is not the molecular manipulation of the FCoV genome 
but, rather, the ability to propagate type 1 FCoVs in cell culture without extensive adaptation. 
Although there has been recent progress in the development of enterocyte cell lines that 
propagate type 1 FCoVs (Desmarets et al., 2013), we believe that a more robust cell culture 
345 system that allows for the propagation of high virus titres and the rescue of both mutated and 
non-mutated virus will be needed. To achieve this, identification of both the cellular receptor 
and attachment factors specific to type 1 FCoVs and the transduction of well-established, 
continuous, feline cell lines that can be easily maintained will be essential. 

The second required element, a valid experimental model of FIP, is also more 
350 challenging than it may, at first, appear. For example, many of the commonly used animal 
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models of FIP often involve intraperitoneal inoculation. If the natural course of FCoV infection 
involves sequential replication in the gut, low level replication in blood monocytes and high 
level replication in monocytes and macrophages, and each transition is associated with the 
selection of specific mutants, then this has to be reproduced in any valid experimental model. 
355 In a recent report, Tekes et at. showed that intraperitoneal infection of cats with a 
recombinant form of the FCoV 79-1146 strain robustly induced FIP (Tekes et at., 2012). 
Strikingly, the virus re-isolated from these cats demonstrated that there had been strong 
selection for a virus that reverted to encode an intact 3c protein. This is, in our view, good 
evidence that FIP results from an infection that involves initial replication in the gut. 

360 In summary, our results contribute to a better understanding of FCoV genomic 

mutations that may or may not be used as markers of the virus phenotype. It is also clear 
from the results that the relationship between the viral genotype and the development of FIP 
is complex. The further analysis of complete FCoV genomes in defined clinical samples, a 
robust reverse genetics system that can be applied to field strains of serotype 1 FCoV, and 
365 the development of valid experimental models of FIP will all be needed to throw further light 
on this relationship. 
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Methods 


370 Clinical samples and RNA extraction 

The samples selected for this study were fecal samples from three healthy kittens and 
post-mortem tissue lesion samples from three kittens with FIP. These samples were all 
obtained from a previously reported epizootic outbreak at a single-site UK feline rescue 
center (Barker et at., 2013). The three tissue lesion samples, designated here as 26M 
375 (mesentery), 27C (colonic lymph node) and 280 (omentum), were from cats F/FIP, Z/FIP 
and J/FIP in the previous study (Barker et al., 2013) and had been collected within 2 h of 
death, placed in RNAIater (Life Technologies) for 24-48 h at 4°C and then, after discarding 
the RNAIater, stored at -80°C. The diagnosis of FIP was confirmed by post mortem 
examination including histopathology and immunohistochemistry for the demonstration of 
380 FCoV antigen in lesions (Kipar et al., 1998). The fecal samples (65F, 67F and 80F, 
previously named #65, #67 and #80) were collected from the healthy cats within one 
month of euthanasia of the cats with FIP (Barker et al., 2013). Samples 80F and 27C were 
from cats that were littermates and were housed within the same pen. All three cats that 

provided fecal samples remained alive and without any clinical signs that could be 

385 suggestive of FIP for over 1 year post sampling. Fecal samples were stored at -80°C 

immediately after collection. 

Total RNA was extracted and purified from 20 mg of tissue with a NucleoSpin RNA kit 
(Macherey-Nagel) based on the method described by Dye and colleagues (Dye et al., 2008; 
Dye & Siddell, 2007). Briefly, 20 mg of each tissue sample was disrupted in a 2 ml tube by 

390 adding 500 pi lysis buffer containing 1% (B-mercaptoethanol (v/v) and a 5 mm stainless steel 

ball bearing. The sample was homogenized using a TissueLyser II (Qiagen) at 30 Hz for 2 
minutes and 470 pi of lysate was added to a filter column and centrifuged for 30 seconds at 
10,000 x g. A 350 pi aliquot of the filtrate was added to 250 pi of ethanol and run through a 
binding column to which DNase I was added to remove genomic DNA. Following multiple 
395 washes, the RNA was eluted into 50 pi nuclease-free water. The NucleoSpin RNA kit was 
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also used to extract RNA from fecal samples using a method based on that described by Dye 
et at. (Dye et at., 2008). A fecal suspension was produced by vortexing 0.5 g feces and 4.5 
ml phosphate buffered saline 5 times for 30 seconds. Subsequently, 100 pi of this 
suspension was centrifuged for 2 minutes at 10,000 x g, and the supernatant removed and 
400 added to 350 pi of lysis buffer containing 1% p-mercaptoethanol (v/v). The protocol 
described above (from the filter column) was then followed. 

Histology and Immunohistochemistry 

Formalin-fixed tissue samples (26M, 27C, 280) were routinely paraffin wax 
405 embedded and examined histologically to confirm the presence of typical FIP lesions. The 
immunohistochemistry served to demonstrate FCoV antigen within lesions, as described 
previously (Kipar et at., 1998). 

Quantitative RT-PCFt and virus-specific oligonucleotide primer design 
410 FCoV RNA was amplified from fecal and tissue samples using quantitative reverse 

transcriptase-polymerase chain reaction (qRT-PCR) as described previously (Dye et al., 
2008; Porter et al., 2014). Oligonucleotide primer pairs (Table 3) were designed to produce a 
total of 7 RT-PCR products (amplicons) spanning the entire coding region of the FCoV 
genome using the MacVector Primer 3 software package. Initially, the primers were designed 
415 based on the genome sequence of FCoV ClJe, a serotype 1 FCoV (Dye & Siddell, 2007). 
The primers were then compared to an alignment of 29 serotype 1 FCoV genome sequences 
(ClustalW, available upon request) and optimized to allow for sequence variation and 
compatibility of the primer pairs. All primers were synthesized by Eurofins MWG Operon. 

420 One-Step RT-PCR 

FCoV-specific primers were used to reverse transcribe and amplify the viral RNA 
contained in 2 pi of extracted total RNA using the Superscript III One-Step RT-PCR System 
with Platinum Tag High Fidelity (Life Technologies) as described by the manufacturer. 
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Briefly, a 50 pi reaction was set up on ice containing 2 pi RNA, 1 pi of 10 pM forward and 1 pi 
425 of 10 pM reverse primer, 25 pi 2X reaction mix (as supplied by the manufacturer), 2 pi 
Superscript III RT/Platinum High Fidelity enzyme mix and water to a final volume of 50 pi. 
The reaction was incubated at 50 °C for 50 minutes to allow cDNA synthesis, and then raised 
to 94'’C for 2 minutes, followed by 41 cycles of 94'’C for 15 seconds, 50-66°C (depending on 
the primer set) for 30 seconds and 68°C for 1 min/kb of product size. The annealing 
430 temperature for individual reactions was determined by the melting temperature of the 
primers used. The reaction underwent a final extension phase at 68°C for 7 minutes and was 
held at 4°C. For each amplicon, 5 pi of the PCR product was separated on a 1% agarose- 
TBE gel to confirm the PCR product size and to estimate the amount of DNA by comparison 
with standards. The PCR products were then pooled in approximately equimolar amounts 
435 and purified using Agencourt AmPure XP beads (Agencourt AMPure XP PCR Purification, 
Beckman Coulter), following the manufacturer’s protocol, and eluted in nuclease-free water. 

Next generation sequencing 

Purified, pooled amplicons were sequenced at the University of Bristol Genomics 
440 Facility using the Ion Torrent platform (PGM with the 316v2 chip). A targeted, virus-specific 
cDNA single-end read library was produced. Briefly, DNA was fragmented using the Ion 
Xpress Plus Fragment Library Kit, ligated to Ion-compatible barcoded adaptors and size- 
selected for a target read length of 150-200 bases. The library was then amplified and 
purified using the Ion Plus Fragment Library Kit and the Agencourt AMPure XP Kit. The 
445 barcoded libraries were quantified and pooled in equimolar amounts using Bioanalyzer 
quantitation. Templates were prepared from the barcoded, pooled libraries using the Ion 
OneTouch 2 System. Routinely, four genomes were sequenced on a single 316v2 chip. 

Bioinformatics 

450 Sequence data were analyzed using bioinformatics tools including both de novo 

assembly (Trinity, http://trinityrnaseq.sourceforge.net/) and genome alignment (Bowtie2, 
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http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) methods. Briefly, for samples 80F and 
27C, a de novo consensus sequence was produced from the FASTQ reads using the Trinity 
assembled components and the MacVector assembly project tool (Grabherr et al., 2011). In 
455 order to identify and correct possible errors in this assembly, the same FASTQ sequence 
files were then aligned to the assembled consensus sequence using Bowtie2. The 
alignments were visualized using the Integrative Genomics Viewer (IGV) and the consensus 
sequence manually corrected on the basis of the sequence reads. Subsequently, the FASTQ 
sequence reads for four samples (65F, 67F, 26M and 280) were aligned to the corrected 
460 80F consensus sequence using Bowtie2. Again, IGV was used to confirm each consensus 

sequence with regard to the relevant sequence reads. All of the assembled genome 
sequences were examined and confirmed to have the expected FCoV genome architecture 
and predicted ORFs. This workflow is illustrated in Fig. 5. For selected viral genes, the 
encoded protein sequences were derived and phylogenetic reconstruction was done using a 
465 neighbor-joining algorithm based upon an alignment generated by ClustalW (MacVector). 
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610 Tables 


Table 1. Amino acid substitutions that partially or fully discriminate between the genome 
sequences derived from non-FIP (fecal) and FIP (tissue lesion) samples. The positions of the 
substitutions are indicated as the position of the relevant mutation/substitution based upon 
615 an alignment of the six FCoV genomes analyzed in this study (Supplementary File 1). The 
amino acid positions in the non-structural replicase proteins (nsps) refer to pplab. 


Nucleotide 

position 

Protein 

65F 

67F 

80F 

26M 

27C 

280 

Amino acid 
position 

1758 

nsp2 

1 

1 

1 

V 

V 

1 

549 

1794 

nsp2 

G 

G 

G 

G 

G 

R 

561 

6553 

nsp3 

D 

D 

D 

G 

G 

D 

2147 

14727 

nsp12 

Y 

Y 

Y 

F* 

F' 

Y 

4872 

17883 

nsp14 

T 

T 

T 

1 

1 

T 

5924 

19277 

nsp16 

N 

N T 

N 

H 

H 

N 

6389 

21370 

S 

S 

S 

S 

A 

S 

S 

403 

21377 

S 

1 

1 

1 

1 

1 

T 

405 

22291 

S 

F 

F 

F 

F 

F 

L 

710 

22361 

S 

S 

S 

S 

1 

1 

S 

733 

22528 

S 

R 

R 

R 

G 

G 

R 

789 

22539 

S 

R 

R 

R 

R 

R 

S 

792 

22757 

S 

S 

S 

S 

F 

F 

S 

865 

23302 

S 

M 

M 

M 

L 

L 

L 

1047 

23486 

S 

1 

1 

1 

T 

T 

T* 

1108 

23589 

S 

K 

K 

K 

N 

N 

K 

1142 

24190 

S 

P 

P 

P 

P 

P 

S 

1343 

24298 

S 

E 

E 

E 

Q 

Q 

E 

1379 

25447 

3C 

T 

T 

T 

T 

T 

M 

165 

25580-25589 

3C 

(-) 

(-) 

(-) 

(+) 

(+) 

(-) 

Deletion 

27228 

N 

S 

S 

S 

L 

L 

S 

170 

28759 

7B 

L 

L 

L 

F 

L 

L s 

198 
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The consensus nucleotide constituted 83% (26M) and 55% (27C) of the sequence reads at 
620 this position. 

t The consensus nucleotide constituted 60% of the sequence reads at this position. 

* The consensus nucleotide constituted 75% of the sequence reads at this position. 

§ The consensus nucleotide constituted 85% of the sequence reads at this position 

In all other cases, the consensus nucleotide constituted more than 96% of the sequence 
625 reads at a given position. 

(-) 3c protein gene was complete 

(+) 3c protein gene had a deletion. The deletions were: 26M and 27C, nt 25584-25593 (10 
nucleotides, AGGAGTTTAC). 
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Table 2. Amino acid substitutions that do not discriminate between the genome sequences 
derived from non-FIP (fecal) and FIP (tissue lesion) samples. The positions of the 
substitutions are indicated as the position of the relevant mutation/substitution based upon 
an alignment of the six FCoV genomes analyzed in this study (Supplementary file 1). The 
635 amino acid positions in the nsp proteins refer to ppl ab. 
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Nucleotide 

position 

Protein 

65F 

67F 

80F 

26M 

27C 

280 

Amino acid 
position 

813 

nsp2 

V 

1 

1 

1 

1 

V 

234 

1794/1795 

nsp2 

E 

G 

G 

G 

G 

R 

561 

2955 

nsp3 

A 

T* 

T 

T 

T 

A 

948 

3082 

nsp3 

R 


K 

K 

K 

R 

990 

3797 

nsp3 

Q 

Q 

H 

Q 

Q 

Q 

1228 

5218 

nsp3 

V 

A 

A 

A 

A 

V 

1702 

5337 

nsp3 

L 

M 

M 

M 

M 

L 

1742 

6804 

nsp3 

A 

S 

A 

A 

A 

A 

2231 

8939 

nsp5 

K 

K 

N 

N 

N 

K 

2942 

14564 

nsp12 

A 

A 

P 

P 

P 

A 

4818 

15185 

nsp13 

1 

1 

L 

1 

1 

1 

5025 

20509/20510 

S 

A 

P* 

L 

P 

P 

A 

116 

20584 

S 

D 

N 

N 

N 

N 

D 

141 

20861 

S 

S 

S s 

N 

S 

S 

S" 

233 

20864 

S 

R 

Q 

R 

Q 

Q 

R 

234 

20866 

S 

1 

L 11 

1 

1 

1 

L 

235 

21275 

S 

R 

R # 

Q 

R 

R 

R 

371 

21467 

S 

1 

T 

T 

T 

T 

1 

435 

22151 

S 

R 

K“ 

K 

R 

R 

R 

663 

22332 

S 

1 

l ft 

M 

1 

1 

1 

723 

27273 

N 

L 

Q 

Q 

Q 

Q 

L 

185 

27873 

7A 

H 

ytt 

H 

Y 

Y 

H 

6 


The consensus nucleotide constituted 71% of the sequence reads at this position. 
640 t The consensus nucleotide constituted 72% of the sequence reads at this position. 

* The consensus nucleotide constituted 71% of the sequence reads at this position. 

§ The consensus nucleotide constituted 68% of the sequence reads at this position. 

11 The consensus nucleotide constituted 85% of the sequence reads at this position. 

11 The consensus nucleotide constituted 71% of the sequence reads at this position. 
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645 # The consensus nucleotide constituted 60% of the sequence reads at this position. 

** The consensus nucleotide constituted 59% of the sequence reads at this position. 

n The consensus nucleotide constituted 56% of the sequence reads at this position. 

The consensus nucleotide constituted 78% of the sequence reads at this position. 

In all other cases, the consensus nucleotide constituted more than 96% of the sequence 
650 reads at a given position. 
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Table 3. Sequences of oligonucleotide primers used in this study. All oligonucleotides are 
shown as 5‘ to 3‘ sequences. The positions of the oligonucleotides are given relative to the 
genome of FCoV Cl Je [GenBank: DQ848678]. 


655 


Name 

Amplicon 

Sequence 

Position in 
ClJe 

Length 

(nt) 

FI 69 

1 

TAGGAACGGGGTTGAGAG 

169-186 

18 

R6507 

1 

GTGCGAGAACRGCCTTAA 

6456-6467 

18 

F5562 

2 

G TTT G A AYT C ACG TG G Y C ATT 

5511-5531 

21 

R7490 

2 

G ARGT CTT CAT CWG AACCCAC 

7441 

21 

F6943 

3 

GCT AGT GTT AGAAATGTCTGTGTT 

6932 

24 

R12466 

3 

AAAAG CCCT ACT AACGT G GTC 

12421 

21 

FI 2224 

4 

CAT CCTGCAATT GAYGGATTG 

12173 

21 

R18105 

4 

T CCGGGT ACAT GT CT ACGTT A 

18054 

21 

FI 7830 

5 

GATT GGT CCATT GT GT ACCC 

17782 

20 

R20131b 

5 

AAARCCTT CCG AT G ACG AGGT 

20080 

21 

FI 9786b 

6 

GT ATT AAG RAG AT G GTT G CC A 

19735 

21 

R26007 

6 

AT AACCG CAT GAGAAAAG G CT 

25793 

21 

F24798 

7 

TAAAATGGCCKTGGTATGTGT 

24601 

21 

R29508 

7 

T AG CT CTT CCATT GTTG G CTC 

29167 

21 
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Figure legends 


Figure 1. Genomic organization of feline coronavirus. Genomic ORFs are shown as boxes. 
Only pplab is shown as a translation product of the genomic RNA. The non-structural 
proteins nspl —11 are translated from ORFIa (dark grey) and translation of the ORFIb 
proteins (nspl 2-16) occurs following -1 ribosomal frameshifting (RFS). Nsp 11 is not 
depicted as it represents a short (nine amino acid) carboxyl extension of nsplO. Nsp9, 
single-stranded RNA-binding protein (ssRBP); nspl 2, RNA-dependent RNA polymerase 
(RdRp); nspl 3, helicase (Hel) and NTPase; nspl4, 3'—>5' exoribonuclease (ExoN) and A/7- 
methyltransferase (A/7-MT); nspl 5, uridylate-specific endonuclease (NendoU); nspl 6, 2-0- 
methyltransferase (2-OMT). 

Figure 2. Agarose gel electrophoresis of PCR amplicons 1-7 for tissue lesion samples 26M, 
27C, 280 and fecal samples 65F, 67F, and 80F. 

Figure 3. Coverage of sequence reads across the assembled FCoV genomes from fecal and 
tissue lesion samples. Sequence reads were aligned against the de novo assembled 80F 
target genome for the feces-derived samples 65F (A), 67F (B), 80F (C) and tissue lesion- 
derived samples, 26M (D), 27C (E), and 280 (F). 

Figure 4. Phylogenetic analysis of the core RNA-dependent RNA polymerase domain of 
nspl2 (amino acids 4503 to 4807 in pplab, Supplementary file 1) for FCoV strains 
sequenced in this study and selected FCoV genome sequences. The phylogenetic tree was 
constructed by the neighbor-joining method from an alignment made with ClustalW 
(MacVector). GenBank accession numbers are shown for all sequences. Bootstrap values 
exceeded 60% at all nodes. 
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Figure 5. Sequence assembly workflow for FCoV genomes. Fecal samples 65F, 67F, 80F 
and tissue lesion samples 26M, 27C, 280. ORFs, open reading frames. 

Supplementary File 1. A nucleotide alignment of the six type 1 FCoV consensus genome 
sequences determined in this study. The alignment was produced using ClustalW 
(MacVector). 
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